Evaluating the Meaning of Synthesized Listener Vocalizations
نویسندگان
چکیده
Spoken and multimodal dialogue systems start to use listener vocalizations for more natural interaction. In a unit selection framework, using a finite set of recorded listener vocalizations, synthesis quality is high but the acoustic variability is limited. As a result, many combinations of segmental form and intended meaning cannot be synthesized. This paper presents an algorithm in the unit selection domain for increasing the range of vocalizations that can be synthesized with a given set of recordings. We investigate whether the approach makes the synthesized vocalizations convey a meaning closer to the intended meaning, using a pairwise comparison perception test. The results partially confirm the hypothesis, indicating that in many cases, the algorithm makes available more appropriate alternatives to the available set of recorded listener vocalizations.
منابع مشابه
Multidimensional meaning annotation of listener vocalizations for synthesis
Listener vocalizations convey affective and epistemic states behind the listener’s intentions while the interlocutor is talking. The meaning annotation of such vocalizations is a crucial step in synthesis of listener vocalizations. This paper presents a perception study to annotate meaning of vocalizations. In this study, subjects annotate (characterize) a set of listener vocalizations using a ...
متن کاملToriyeh: the Way of Escaping from Telling Lies to Patients
Toriyeh means concealing real intention of speech using its parallel and common words so that the listener constructs from speaker's speech a meaning what he/she meant. The purpose of this research is studying jurisprudential dimensions of toriyeh in order to clarify its distinction from lying and related jurisprudential commandments by explanation of the most important discussions about toriye...
متن کاملDetection of Total Syllables and Canonical Syllables in Infant Vocalizations
During the first two years of life, human infants produce increasing numbers of speech-like (canonical) syllables. Both basic research on child speech development and clinical work assessing a child’s pre-speech capabilities stand to benefit from efficient, accurate, and consistent methods for counting the syllables present in a given infant utterance. To date, there have been only a few attemp...
متن کاملIntonation as an interface between language and affect.
The vocal expression of human emotions is embedded within language and the study of intonation has to take into account two interacting levels of information--emotional and semantic meaning. In addition to the discussion of this dual coding system, an extension of Brunswik's lens model is proposed. This model includes the influences of conventions, norms, and display rules (pull effects) and ps...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011